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Abstract 

We describe the Bayesian-based signal-to-noise eigenmode method for cosmological pa- 
rameter estimation, show how it can be used to optimally compress large CMB anisotropy 
data sets to manageable sizes, and apply it to the DMR 4-year, South Pole and Saskatchewan 
data, individually and in combination. A simple prior probability method is used to in- 
clude large scale structure observations. Estimates of the Hubble parameter, vacuum 
energy density, baryon fraction and primordial spectral tilt derived from the combined 
data are given. 



1 Introduction 

As CMB anisotropy experiments have gotten more ambitious, our need for powerful statistical 
methods has become more urgent. For the CMB data sets that have been obtained up to now, 
including COBE (dmr4), the 1994 South Pole data of || (sp94) and the 1993-95 Saskatoon 
data of |3|, U (sk95), which we analyze jointly here, it is possible to do a relatively complete 
Bayesian statistical analysis if the primary anisotropies are assumed to be Gaussian and the non- 
Gaussian Galactic foregrounds are not large. Even for these experiments, this is feasible only 
because of compression, in which the data set is acted upon by linear operators which project 
it onto subspaces of the full data. In the past the linear combinations of the data were defined 
by what made intuitive sense (e.g., weighted sums of different frequency channels or weighted 
averages of pixel separations below the beam scale), and could still leave too many pixels to 
deal with in a complete analysis. Here we use the rigorous signal-to-noise eigenmode approach 
to data compression || to reduce the sets to the manageable < 1000 important combinations. 
As we approach the era of megapixel data sets promised by MAP and COBRAS/SAMBA, 



via the era of tens of thousands of pixels promised by long duration balloon experiments, the 
question of how to come as near to optimal compression as possible given computer limitations 
becomes of paramount importance. This happy day of too many pixels is now upon us. 

The goal is to estimate the parameters {jja} of a target set of theories with angular power 
spectra by first determining the likelihood function C{{ua}) for each theory, and then 
comparing the likelihoods as a function of the parameters. Using Bayes' theorem, we can 
write the probability of the theoretical parameters, given the observations and the class of 
theories being tested, P({y a}\OBS, TH) oc C({yA\)P({yA}\w' lOT )i where the assumed prior 
probability distribution P{{yA\\w' 101 ) can reflect a priori maximal ignorance, or take into 
account constraints from other information such as large scale structure observations. The 
proportionality constant is related to the probability that the class of theories is correct given 
the observations. To give preferred values and errors for a specific cosmological parameter 
of interest such as the Hubble parameter, one often integrates (marginalizes) over the other 
parameters, such as the density fluctuation power amplitude on cluster scales, <t 8 , and the 
primordial density fluctuation spectral index n s . 

In this paper, we assume the inflationary model for structure formation with Gaussian 
adiabatic (scalar) density perturbations and possibly gravitational wave (tensor) perturbations. 
We explore constraints in the parameter space {t , h, Q tot , f^n 2 , fl vac , flhdm, ^cdm, v s ,Vt,o-g}. We 
assume reheating occurs sufficiently late to have a negligible effect on Ce. The total density 
parameter, Q to t = + ^cdm + Qhdm + ^vac, is expressed in terms of the densities in baryonic 
cold, hot and vacuum matter, of which Q nr = Qb + ^cdm + ^hdm can cluster. The age of the 
Universe is to an d h is the Hubble parameter in units of 100 km s -1 Mpc -1 ; h£ is a function 
of {fi mc , Qtot}, so one parameter is redundant. The scalar and tensor tilts are v s = n s — 1 and 
z/ 4 .0 With current errors on the data, this space is too large for effective parameter estimation. 
Instead we restrict our attention to various subregions, such as {<jg, n s , h | fixed t , f^lr 2 }, where 
either Q t ot — 1 and VL vac is a function of ht or Q vac = and Q t ot is a function of ht . The to's 
we choose are (11, 13, 15 Gyrs) with Q vac > 0, but Q t ot = 1- For these cases, the "standard" 
nucleosynthesis value f^h 2 = 0.0125 was chosen. We constrain 0.5 < n s < 1.5, 0.5 < h < 1. 
For VL tot = 1 models, we have roughly VL vac (h) ~ 0.9[0.3(h/h a - l) 3 + 0.7(h/h a - l) - 4 ], where 
hi = 0.5(13Gyr/t )- A recent estimate for globular cluster ages is U.6±\i Gyr §. For the case 
Qvac — 0, we have also let fi^h 2 vary, over the range 0.003125 < fi^h 2 < 0.05, and we have also 
done a limited exploration of the 13 Gyr, Q t ot < 1, ^vac = 0, space with tilt, using C' £ s from 
Bond and Souradeep. The reason for restricting the paths through parameter space is because 
of the length of time required for a complete statistical treatment of each data set per model 

e £ . 

In Fig. 0, the bandpowers M associated with current experiments are compared with some 
of the Ces in the parameter space we are exploring, here the 13 Gyr, H = 50, tilted sequence 
with gravity waves included and the 13 Gyr fl vac > sequence with n s = 1, with amplitude 
normalized to best-fit the 4-year DMR data. In both cases, f^li 2 = 0.0125. The curves are 
very similar if we allow for a mix of hot and cold dark matter with the same as these 
CDM models, and the other parameters fixed. The solid dark curve is the "standard" untilted 
CDM model. The bandpowers for the three experiments analyzed here, dmr4, sp94, sk95, are 

x Ci = £(£ + l)C£/(27r), where = (|a^ m | 2 ) is the CMB power spectrum as usually defined and the ag m are 
the spherical harmonic coefficients of the temperature fluctuations for the theoretical signal. 

2 The ratio of gravitational wave power to scalar adiabatic power is Vcw/'Pad ~ (— 4^t)/(l — apart 
from small corrections. This determines the level of tensor anisotropies compared with scalar. The tensor tilt is 
related to the deceleration parameter q of the Universe during inflation by v t j2 i=s l + q~ x plus small corrections. 
Here, we take one of two cases. (no-GW case): v t — 0, thus no gravity wave contribution (for nearly critical 
acceleration, q i=s —1, as in natural inflation). (GW case): v t — v s if the scalar tilt is negative (subcritical but 
nearly uniform acceleration) and vt = if the scalar tilt is positive. 



the darker heavier data points. Because of the differing angular scales involved we gain a long 
lever arm with which we can constrain cosmological parameters more strongly than with any 
individual experiment. The lower panels in the figures are closeups of the first and second 
"Doppler peak" regions. Fig. || gives the best fit models, described in § ||. 



2 Signal-to-Noise Eigenmode Method 



We are given the data in the form of a measured mean A p of the anisotropy in the pth 
pixel, along with the variance about the mean a D for the measurements. In general, there 
are pixel-pixel correlations in the noise, defining a correlation matrix Cr> PP > with off-diagonal 
components as well as the diagonal a 2 Dp = Cd pp . Also there is usually more than one fre- 
quency channel, with the generalized pixels having frequency as well as spatial designations. 
The theoretical signal also has a correlation matrix, Ct pp >, which is a linear combination of a 
product of the Ce times a "window function matrix" W PP >{1) encoding the possibly frequency- 
dependent beam, the chopping strategy, sky coverage, etc. for the experiment: Ct pp ' = 
J2eC e W pp i(£)(£ + + 1)] (see e.g., 0). The "window function" usually reported for an 

experiment is W t = (1/N pix ) J2 P =i W pp {£). 
The likelihood function is 

\nC({y A }) = -\~E\C n + C T )~ l A - \ lndet(C n + C T ) - N pix In y^F. (1) 

Here f denotes transpose. The noise correlation matrix C n = Cd + C res consists of the pixel 
errors Cd and the correlation of any unwanted residuals C res , such as Galactic or extragalactic 
foregrounds. One can think of C res as increasing the noise for selected correlation patterns in 
the medium. With a large enough noise in these patterns, they are effectively projected out 
from the data. 

Constraints such as averages, gradients (dipoles, quadrupoles) and known spatial templates, 
which may be frequency dependent (e.g., IRAS or DIRBE combined with appropriate extrapo- 
lations) can also be modelled in the total A p , as "nuisance variables" to be integrated (marginal- 
ized) over. Denoting each constraint c on pixel p by K c T pc , where the template for constraint 
c is T pc and the amplitude is k c , we need only replace A p in eq. (JT|) by A p — J2 C Y pc K c , then 
integrate over the amplitudes k c , assuming some prior probability distribution. This is most 
easily done if we assume the k c are distributed as very broad Gaussians, reflecting our ignorance 
of their values (or, if we know their likely range, incorporating that as prior information in the 
Gaussian spreads). The integration over k c then yields eq.([TJ) with the residual noise matrix 
given by C res = TKT\ where K cc i = (k c k c >) is the assumed prior variance for the constraint 
amplitudes. As the eigenvalues of K become very large, the effect of the constraint matrix 
is to project onto the data subspace orthogonal to that spanned by T pc . Although one can 
directly use the likelihood equation in this projection limit (using 5^ Nc '(k) for the constraint 
prior), it is computationally simpler to use the Gaussian prior. (Taking into account constraints 
with amplitudes that are not linear multipliers times the template is much more complex.) A 
suitable C res can also allow us to focus attention only on a specified band in £-space for power 
spectrum estimation. 

In practice, we do not compute the quantities A^ (CV + Cn)" 1 A and det[CV + C n ] di- 
rectly; instead we go to a basis (i.e., linear combination of the data) in which CV and C n 
are diagonal. First, we whiten the noise matrix using the nonorthogonal transformation pro- 
vided by its "Hermitian square root," C n — > C^ l ^ 2 C n C^ 1 / 2 = I; we apply the same trans- 
formation to Ct and diagonalize this in turn with the appropriate matrix of eigenvectors, R: 
C T -> RC- l,2 C T C- l l 2 R) = diag(£ fc ), which has units of (S/N) 2 . We then transform the data 
into the same basis, A -> RC^A = £, now in units of (S/N). The transformed theory matrix 
£ still depends on the theoretical amplitude (erg, etc.) as a simple multiplier, which enables the 
likelihood to be easily calculated as a function of this parameter. In the new basis, the noise 
and signal have diagonal correlations and = 1 + so £ fc is useful as a theory- dependent 
S/N power spectrum which gives a valuable picture of the data and shows how well the target 
theory fares (Fig. |)§, |, ||. 
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Figure 1: 13 Gyr sequences, varying n s with 
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^vac = 0, and varying Hq, hence £l va c, with n s = 1. 
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Figure 2: Best-fit models using DMR+SP94+SK95 on the left, and adding LSS on the right, for 
11, 13 and 15 Gyr A sequences. The left also shows 13 Gyr open and baryon sequence best fits. 
Parameters defining these models are given in Table 2. 



The modes are sorted in order of decreasing ^/TV-eigenvalues, so low fc-modes probe the 
theory in question best. This expansion is a complete (unfiltered) representation of the map. 
The optimal method for data compression is to use sharp signal-to-noise filtering, keeping only 
those high S/N modes with k < k cut and deleting low S/N ones. We also find it extremely useful 
to look carefully at the power in the low S/N modes to determine whether further residuals 
need to be added to the generalized noise: a poor model for the noise can give false indications 
of what the data is saying and misrepresent the signal. Filtering using S/N-modes has a long 
history in signal processing where it is called the Karhunen-Loeve method fllPf , and it is now 
being widely adopted for analysis of astronomical databases. 

For an all-sky experiment with uniform, uncorrelated pixel variances, the eigenmodes are the 
spherical harmonics, and the eigenvalues the expected coefficients ai m . For a more complicated 
experiment, the high-SyiV modes probe the peak of the experiment's window function in £-space. 
Low-S/N modes are more complex. For experiments with more than one frequency channel, 
differences between channels should show no CMB signal, and so the eigenvalue should be 0. 
Nearby pixels, oversampling the beam, should also show very little signal — the smooth fall from 
high S/N to low traces the beam in much the same way that the window function falls as a 
Gaussian oc exp[— £ 2 /£ 2 ] at high £. We expect these low S/N modes to be largely independent 
of the theory used to calculate the appropriate CV, which enables these modes to be used as a 
diagnostic of both the analysis procedure and the experiments themselves. 

3 The Experiments Analyzed 

We now discuss the anisotropy experiments we use. The six COBE/DMR four-year maps 
are first compressed into a (A+B) (31+53+90 GHz) weighted-sum map, with the customized 
Galactic cut advocated by the DMR team, basically at ±20° but with extra pixels removed in 
which contaminating Galactic emission is known to be high, and with the dipole and monopole 
removed. Galactic coordinate pixels are used; slight differences arise with ecliptic coordinate 
pixels. Although one can do full Bayesian analysis with the map's (2.6 ) 2 pixels, this "resolu- 
tion 6" pixelization of the quadrilateralized sphere is oversampled relative to the COBE beam 
size, and there is no effective loss of information if we do further data compression by using 
"resolution 5" pixels, (5.2 ) 2 || p]]. The weighted sum of channels is an exact use of our optimal 
signal-to- noise compression. The resolution degradation is not optimal but is nearly so (Ct{0) 
is nearly constant for separations 9 below the scale of the beam, so adjacent pixel differences 
have tiny signal but the usual data-noise). The Galactic cut is also not optimal, but could 
be made so by using explicit templates for Galactic foregrounds to include in C n , as described 
above. The combined effect reduces the pixel number from 6 x 6144 to 999. Further compres- 
sion by a factor of two or so is possible without much information loss |5| . A strong indication 
of the robustness of the dmr data set is the insensitivity of the band-powers to the degree of 
signal-to- noise filtering and to which frequencies are probed. For C res , we include templates for 
the monopole, dipole and quadrupole, the latter allowing for a Galactic foreground contami- 
nant, which we know is there at low i in the 31 GHz channel. The DMR data probes i ~ 2 — 15 
well, with useful information out to ~ 30. 

The sp94 experiment || is similar to a classic single-differencing chopping experiment, 
except that differencing is associated with the oscillation of the beam about the pixel position. 
It probes i ~ 30 — 120. The number of frequency channels and spatial pixels is sufficiently small 
(301) that no compression is needed: all 7 frequencies in the Ka and Q bands at ~ 30 and ~ 40 
GHz are simultaneously analyzed. There are 14 constraints, average and gradient removals 
for each frequency. Taking differences in AT/T in frequency at the same spatial position is 



insensitive to the primary signal but has the usual pixel noise for each channel, so S/N filtering 
would tend to remove those modes and strong compression would result. Because the beams 
do vary somewhat with frequency, however, the compression would remove some information, 
unlike for COBE. 

The sk95 experiment H probes a much larger band in £-space, from ~ 50 to ~ 400. Even 
before the data was delivered to us a significant amount of frequency and spatial compression 
already took place. In this paper, for parameter estimation, we use the "CAP" data (2016 pixels, 
including rebinned data from sk94, with 48 constraints associated with average removals). The 
SK experiment measured the temperature directly by making slightly-curved radial scans from 
the North Celestial Pole about 8 degrees in length, which covered the CAP as the earth rotated. 
The data was binned in RA, but, instead of binning in declination, it was projected in software 
onto what are in effect 3 to 19 beam "chopping" configurations. Adding the RING data to 
the CAP, involving sweeps in a ring around the NCP at ~ 8° brings the total to 2400 pixels, 
with 52 constraints. In 0, we show that the "CAP" and "CAP+RING" parameter estimates 
agree to much better than "one sigma" even though the RING adds substantially more data. 
One potential concern is that only one HEMT band is represented in the data. We have also 
extensively analyzed the SK94 data set on its own ||, with 1344 pixels and 28 constraints, 
which has only 3 to 9 beam template projections and substantially fewer hours of integration 
than the 94+95 data, but the advantage of having both Ka and Q band information so the 
frequency spectrum can be checked. We agree with || that the spectrum of the SK94 3 to 9 
templates is consistent with a CMB origin, and inconsistent with likely Galactic foregrounds. 
(We come to the same conclusion for the sp94 data, in agreement with 0.) SK95 had data 
only from the Q-band. 

In analyzing SP94 and SK94-95, it is essential to include errors in the overall calibration 
of AT. For SK94-95 it is estimated to be a Gaussian with standard deviation e = 0.14; for 
SP94, e = 0.10. Let C (cr s ) denote the calculated likelihood assuming no such errors; then 
the likelihood with the uncertainty included is £(cr 8 ) = / da' 8 exp[— (erg — a' 8 ) 2 / (2e 2 al)]Co(a' 8 ) . 
It is unfortunate that after all of the effort that has gone into these superb experiments, an 
astronomical issue like the brightness of Cas A (for SK95) results in a substantially poorer 
constraint on a 8 than one obtains assuming no such calibration uncertainty. 

4 Phenomenology, S/N Power Spectra and Data Compression 

As we mentioned in § [l], we have chosen to order our path through parameter-space using the 
cosmological age of the models, t^. To examine the phenomenology of the experiments we shall 
use a one-parameter sequence of Ci shapes, with the overall bandpower of the experiment (or 
the related as) as another parameter. While it was usual in the past to use a power law in 
C £ , ~ (£ + ~) UAT [g, it is evident from Fig. | that this would be a very bad fit to the sk95 
data, although it is a reasonable representation over the limited I range for both the dmr4 and 
sp94 data. The sequence we use is the first panel in Fig. [I], the tilted CDM sequence for the 
standard CDM model, i.e., H = 50, Q = 1, Qb = 0.05, with n s variable. We use the GW case, 
i.e., u t — n s — 1 if n s < 1, v t — otherwise. These models have an age of t = 13Gyr. 

Fig. | shows 1,2,3,... sigma contours of C(a 8 ,n s ), with z/-sigma defined by C/C max = 
exp[— 1/ 2 /2]. It is clear from the right hand panel that fixing n s — 1 and to — 13, but varying h, 
and therefore VL vaC) is not a good sequence to use for phenomenology since there is very little 
difference in the C^'s as h varies. The Fig. || £(<78,h) contour map shows that indeed the data 
does not determine the Hubble parameter very well. 

With the most recent experiments (N pix > 2000), the computer power required to calcu- 
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Figure 3: Contour plots of the likelihood of as and n s for fixed h = 0.5. The contours are at C = 
exp[— |{1, 4, 9, . . .}]C ma x (with an extra contour around C rnax to show where it is). In every panel, 
the lefthand as axis is for the CDM sequence, while the righthand as axis is for a £lhdm = 0.2£l nr 
mixed dark matter model, as inferred from cluster abundances is ~ 0.6. 
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Figure 4: Likelihood of as and h for fixed n s = 1. In this case, the righthand as axis strictly applies 
only for the h=0.5 line. 



late the likelihood over a sufficiently wide model space is becoming prohibitive. The S/N- 
eigenmodes also provide a form of data compression which can drastically reduce the required 
analysis time. By rotating to a basis in which some "canonical theory" with correlation matrix 
Ct* is diagonalized by the matrix R*, but only retaining some fraction of the modes, we effi- 
ciently remove parts of the data dominated by noise (i.e., modes with very low S/N <C 1), but 
retain the Gaussian character of the likelihood for the remaining modes. For other theories, the 
transformed theory matrix (R^C^'^CtC' 1 ' 2 R\) will no longer be diagonal, so the full matrix 
calculation must still be performed, but now on the smaller space of observations restricted to 
the modes with the highest "canonical" S/N. Moreover, for these theories, the S/N-modes will 
be somewhat different, so the compression will not be as efficient (i.e., we will have thrown out 
a bit "more signal and less noise"). Still, we have achieved compression as good as 90% for 
experiments (like SK94) with two channels and 65% for the full SK94-95 dataset, which has 
already been re-binned to remove some of the redundancy in beam oversampling and channel- 
to-channel differences. Because the cost of the matrix calculations involved scales as N 3 , these 
result in significant speedups: from ~ 1 hour to ~ 10 minutes per point in parameter space for 
sk95 (which is actually significantly worse than the expected (1 — 0.65) 3 ~ 1/25 speedup due 
to overhead). 

In Fig. ^ we show 1, 2, 3, . . . sigma likelihood contours for the SK94-95 CAP dataset, as in 
Fig. 0. The lower left panel superposes the contours of the k cut = 700 case upon those with all 
2016 modes included. The similarity of the contours shows that both the amplitude and the 
index determinations are not compromised by S/N cuts. For the k cu t = 500 case, contours > 2 
are very similar as well. Thus we can achieve significant degrees of compression without loss of 
information. In the following, we apply no data compression to the analysis of the DMR and 
SP data, but for SK95 we present results using the top 700 modes from the canonical standard 
CDM theory, the n s = 1 model in the to — 13 Gyr sequence. 

The reason the compression works can be understood by examining the S/N power spectra, 
shown in Fig. [5[ The curve is the theoretical spectrum 1 + £j. given by the eigenmodes, for a 
"standard CDM" model with amplitude erg = 1.2, the value suggested by COBE. The points 
are the observations with the same binning as the theory curve. (The bins require a certain 
signal-to-noise when summed, but a minimum number are required to define a bin so that 
the error bars are not too large.) The error bars contain both variances associated with the 
pixel noise and with the theoretical cosmic variance (noise-noise, noise-signal and signal-signal 
terms). To be a good fit to the data, the error bars should pass through the theory curve. 
After the top few hundred modes, the eigenvalues have S/N <C 1 so we do not expect them to 
contribute significantly to the likelihood. We emphasize that it is legitimate to use any mode 
subset: the relative likelihoods we obtain will tell us which theory is preferred for those modes. 
It is just that we do not want to build any prior prejudice for a theory by compressing the data 
in a way which may be biased in its favour over the other theories we are testing. Thus we 
choose to go far into the S/N tail, retaining 700 modes. The S/N mode formalism also can be 
used to design experiments to discriminate particular theories (e.g., Knox, these Proceedings). 

5 Combining Experiments and Parameter Estimation 

Combining experiments to get a total likelihood is straightforward. If the pixels are uncor- 
rected, either because they overlap little on the sky or in £, we only need to multiply the 
individual likelihoods together. This is the case for COBE/DMR, SP94, SK94-95. If there is 
significant overlap, then the experiments should ideally be combined and considered to be one 
larger experiment, with Ct connecting the pixels in one experiment with the pixels in the other, 
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Figure 5: Observed and theoretical S/iV-spectra with la errors (including pixel noise and cosmic 
variance), using "standard" CDM, with eg = 1.2, the DMR4 value, for the theory. 
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Figure 6: How as-n s likelihood contours change with the SK95 S/N cut: for 500, 700 and all (~ 2000) 
modes. 



although the cross-pixel Co will be zero. We have applied this to the SK95+MSAM dataset, 
in joint work with Charbonneau and Knox, but will not describe it here. 

The upper right panel of Fig. ^| shows the likelihood for the H = 50, GW, 13 Gyr, tilted 
sequence. Each experiment individually constrains the amplitude, cr 8 , better than the shape, 
n s : the window functions cover a narrow range of i. Note that the SK experiment does better 
than DMR at determining the slope. The calibration uncertainty for SK95 is the reason that 
<r 8 is not more tightly constrained. Fig. || shows the advantage offered by combining the results 
of different experiments: the long baseline in i helps considerably in localizing the n s contours. 
In Fig. f|, we see that the DMR data does not restrict the value of h, and thus not of Q vac , 
whereas the SK data does, yet the combined data focusses the a$ determination, but not the 
h determination. The SK95 data prefers more power than is predicted by the DMR data for 
"standard" models. Thus, the n s = 1, H = 50, tt B h 2 = 0.0125 model has a s = 1.20 ± 0.08 for 
DMR, while for SK95 it is a s = 1.48lo;io- Increasing fi^h 2 to 0.2 brings them closer into line, 
erg = 0.89 ±0.06 for DMR, while for SK95 it is <j$ = 0.821q ni and this model is much preferred 
statistically to the 0.0125 one. 

We could repeat the contour maps for the 11 and 15 Gyr vacuum sequences, for the fixed t 
open models, and for the variable Qb sequence, but it is more concise to quote single numbers, 
our estimates of the individual cosmological parameters. To that end, we marginalize over 
the other parameters in the sequence, assuming a prior probability for the parameters. If it 
is uniform we get the results in the left columns of Table |]. The idea that first motivated 
this project was that the SK94 and SP94 data looked sufficiently robust to return to the 
multiresolution approach combining experiments to get best possible constraints, e.g., Q], and 
this would significantly improve the COBE-only errors on n s . The SK94+SP94+DMR4 column 
is the culmination of that effort. However, the SK95 data took us to significantly higher i and 
the promise of greater discrimination among models based on how they rise to the Doppler 
peak. Notice the rather large shift in n s when we pass from the 1300 pixel 2-channel SK94 
data, which had chopping templates from 3 to 9, to the SK95 set, which had 10-19 projections 
as well as much more 3-9 data, but only for the Q-band. A worry is that non-cosmic signals at 
high i might be contaminating the 10-19 template projections. 

We now wish to add some large scale structure constraints, by constructing prior proba- 
bilities that roughly correspond to the restrictions arising from observations of galaxy clus- 
tering and cluster abundances.^ The linear power spectrum for density fluctuations is often 
characterized by the shape parameter, T « f2 nr he~^ s ( 1+nm '( 2h ) 1/ ' 2 )~ a06 ], j s q.48 for the 

standard CDM model. Here Q nr = Qb + ^cdm + Qhdm- Assuming a linear bias model for 
how the galaxy distribution is amplified over the mass distribution, the clustering data implies 
0.15 < T + ^ s /2 < 0.3. The abundance as a function of X-ray temperature also heavily con- 
strains as- Values from 0.5 up to 0.7 are obtained for Q nr = 1 CDM-like models. For Q vac > 0, 
the value is higher, scaling roughly as Q~® 56 . There are also many estimates of the combination 
(TsQnr 6 that are obtained by relating the galaxy flow field to the galaxy density field inferred 
from redshift surveys, which all take the form [b g as\ /3 g , where b g is the galaxy biasing factor and 
j3 g is a number whose value depends upon data set and analysis procedure: [|11] give 0.64 ±0.27 
for an average of a number of estimates in the literature, and 0.55 ± 0.10 for a determination 
using a maximum likelihood technique for the IRAS survey and the Mark III velocity field data 
set, while a higher (~ 0.7) number is obtained using POTENT on this data set. It is usual to 
take b g « a^ 1 for galaxies, which gives a a 8 consistent with the cluster value, but b g can depend 

3 We can also augment the prior probability with likelihoods for CMB experiments that we have neither the 
data nor the time to analyze properly, by using the quoted flat bandpower alone, along with the appropriate 
window function. Many groups are now using this technique exclusively to estimate CMB parameters; in joint 
work with Knox we show how well this extreme form of compression works. 



case 


DMR4 


SK94+SP94 


SK94-95 


LSS+DMR4 


LSS+DMR4 








+DMR4 


+SP94+DMR4 




+SK94-95+SP94 


13 Gyr Vt vac sequence, 


with GW 








#o=50 
#o=70 


n s 
n s 


1.02+^ 


0.951$ 

0.921$ 


1-121$ 


0.761$ 
0.991$ 


0.851$ 
0.991$ 


All H 


n s 






i-iiliS 


1.021$ 


1-071$ 


n s =l 


H 


< 90 


< 65 


< 65 


68 ±3 


67 ± 3 (n vac w 0.60) 


All n s 


Ho 






< 76 


< 68 


70 ± 4 (Q vac « 0.66) 


15 Gyr Vt vac sequence, 


with GW 








#o=43 
#o=70 


n s 
n s 


1.021^ 

i-28l : i 


0.931$ 

o.89l:8| 


1-041$ 
1.021$ 


0-841$ 
1.251$ 


0.911$ 
1.261$ 


All # 


n s 






i-05!:il 


i.i7l:ll 


1.031$ 


n s =l 


Ho 


< 76 


< 51 


< 61 


54 ±3 


53 ± 2 (fl vac « 0.50) 


All n s 


Ho 






< 63 


< 57 


55 ± 4 (Q vac ss 0.52) 



Table 1: Some results for n s and #o from our analysis for various combinations of experiments 
as indicated. Upper limits and error-bars are "l-cr" ones determined using likelihood ratios C = 
max- In rows 1, 2 and 4 for each case, #o or n s is fixed and only as is marginalized. The other 
two marginalize over #o or n s as well as as- In all cases, we have fixed to = 13 Gyr, f^h 2 = 0.0125, 
tttot = 1 an d included tensor modes for n s < 1. Note that whether the small angle experiments are 
added to LSS+DMR4 or not has little impact on the n s (#o) estimation if #o (n s ) is fixed. 



DMR4+SP94+SK94-95 BEST FIT MODELS, with GW 



case 


#0 


n s 


^vac 


0-8 




13 Gyr B 


50 


1.15 





1.8 (1.24) 


(Q B = 0.013) 


13 Gyr B 


50 


1.0F 





0.9 (0.62) 


(n B = o.i7) 


15 Gyr A 


43 


1.05 





1.1 (0.76) 


(n B = 0.068) 


13 Gyr A 


50 


1.15 





1.3 (0.90) 


(n B = 0.05) 


13 Gyr A 


60 


1.0F 


0.43 


1.22 


(n B = 0.035) 


11 Gyr A 


59 


1.2 





2.2 (1.54) 


(fl B = 0.036) 


13 Gyr OPEN 


55 


1.05 


n tot =om 


1.0 


(n B = o.o4i) 


LSS + DMR4+ 


SP94- 


fSK9^ 


L-95 BEST FIT MODELS, with GW 


15 Gyr A 


55 


1.0 


0.52 


0.97 


(best fit) 


13 Gyr A 


65 


1.0 


0.56 


1.17 


(lo" down) 


11 Gyr A 


85 


1.05 


0.69 


1.49 


(2cr down) 



Table 2: "Best fit" models for various regions in the scanned parameter space. The top panel 
considers the combined likelihoods from the CMB experiments. In the <jg column, the numbers 
in parentheses are the equivalent values for a cold+hot dark matter universe with flhdm = 0.2. 
For the A cases, £l B is not varied separately, but fixed at il B h 2 = 0.125. The bottom panel 
adds in the prior information from large-scale structure and lists the best fit model for each 
age, as well as their relative likelihoods. F means the parameter value was forced to be the one 
indicated. 



upon the galaxy types being probed, upon scale, and could be bigger or smaller than <j$ . 

We want to choose priors for cr 8 , Q vac and v s that reflect these LSS ranges, but we certainly 
don't want to be too miserly in our choice of allowed ranges. A straight Gaussian tends to 
be overly supportive of the mean, while a tophat error has no probability in the wings. Using 
priors which convolve a Gaussian with a tophat and have different upper and lower errors give 
us the flexibility we require. It is similar to specifying both a statistical and a systematic error. 
For the exercise shown in the tables, we required that T + u s /2 be 0.221q;oJ1q;q7 and cr 8 f2^ r 56 
be 0.651 

aoiiaol- The latter has a high probability at 0.55, but little at 0.50 (although some 
authors actually prefer this value). Sample LSS+CMB numbers are given in Table [I]. 

The tiny error bars when LSS constraints are added to the CMB data are amusing, but are 
far from definitive at this stage. The reason the errors are small is typically that the CMB data 
pushes for a likelihood peaked at high cr 8 > 1, and this multiplies the LSS likelihood peaking 
at 0.6 or so. The product of the two has a narrow peak but also a small likelihood. This 
asymmetry is not as pronounced for the hot/cold hybrid models. 

Table |2| gives the parameters for the best fits to the data for the various cases. The associated 
C^s are shown in Fig. 0. Note that the models which best fit the CMB data for a given age often 
have positive tilts. While positive scalar tilt is possible in inflation models, it requires special 
constructs in the inflaton potential in a region corresponding to just where we can observe it 
with the CMB. More likely are negative tilts. If we restrict our attention to these (e.g., second 
row), then the best fit for the Qb sequence (13 Gyr, H = 50) is n s « 1 and VLb ~ 0.17, high 
not low. In the lower LSS part of Table |2|, the 13 Gyr best fit is one sigma down from the 15 
Gyr best fit, and the 11 Gyr is two sigma down. 

The analysis shows that n s lies close to the value predicted by inflation. The H limits are 
suggestive, but better CMB data is needed to strengthen the constraint to usable values. Of 
course when the LSS data are included, Q vac > is suggested for n s — 1 CDM models, though it 
is not needed for hot/cold hybrids with flhdm = 0.2Q nr . Table p] shows adding SK95 and SP94 to 
LSS and DMR4 does not add much further discrimination, but this should change dramatically 
in the next few years, with the advent of long duration balloon experiments, interferometers, 
MAP and COBRAS/SAMBA. 

We would like to thank Barth Netterfield and Lyman Page for helping us understand how 
to model their data and Lloyd Knox for useful conversations. 
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